AITopics | Accra

Collaborating Authors

Accra

Revealing Geography-Driven Signals in Zone-Level Claim Frequency Models: An Empirical Study using Environmental and Visual Predictors

Alfonso-Sánchez, Sherly, Bravo, Cristián, Stankova, Kristina G.

arXiv.org Machine LearningApr-24-2026

Geographic context is often consider relevant to motor insurance risk, yet public actuarial datasets provide limited location identifiers, constraining how this information can be incorporated and evaluated in claim-frequency models. This study examines how geographic information from alternative data sources can be incorporated into actuarial models for Motor Third Party Liability (MTPL) claim prediction under such constraints. Using the BeMTPL97 dataset, we adopt a zone-level modeling framework and evaluate predictive performance on unseen postcodes. Geographic information is introduced through two channels: environmental indicators from OpenStreetMap and CORINE Land Cover, and orthoimagery released by the Belgian National Geographic Institute for academic use. We evaluate the predictive contribution of coordinates, environmental features, and image embeddings across three baseline models: generalized linear models (GLMs), regularized GLMs, and gradient-boosted trees, while raw imagery is modeled using convolutional neural networks. Our results show that augmenting actuarial variables with constructed geographic information improves accuracy. Across experiments, both linear and tree-based models benefit most from combining coordinates with environmental features extracted at 5 km scale, while smaller neighborhoods also improve baseline specifications. Generally, image embeddings do not improve performance when environmental features are available; however, when such features are absent, pretrained vision-transformer embeddings enhance accuracy and stability for regularized GLMs. Our results show that the predictive value of geographic information in zone-level MTPL frequency models depends less on model complexity than on how geography is represented, and illustrate that geographic context can be incorporated despite limited individual-level spatial information.

artificial intelligence, machine learning, postcode 2, (18 more...)

arXiv.org Machine Learning

2604.21893

Country:

South America > Colombia (0.04)
Europe > Belgium > Flanders > Antwerp Province > Antwerp (0.04)
Asia > Bangladesh (0.04)
(8 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Banking & Finance > Insurance (1.00)
Transportation > Ground > Road (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.87)

Add feedback

94cb02feb750f20bad8a85dfe7e18d11-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 09:43:30 GMT

algorithm, qsparse-local-sgd, synchronization, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Champaign County > Urbana (0.04)
North America > Canada (0.04)
Africa > Ghana > Greater Accra > Accra (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

29c0605a3bab4229e46723f89cf59d83-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 22:12:27 GMT

classifier, fairness, group fairness, (13 more...)

Neural Information Processing Systems

Country:

Africa > Ghana > Greater Accra > Accra (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
(3 more...)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Add feedback

Training and Testing with Multiple Splits: A Central Limit Theorem for Split-Sample Estimators

Fava, Bruno

arXiv.org Machine LearningNov-10-2025

As predictive algorithms grow in popularity, using the same dataset to both train and test a new model has become routine across research, policy, and industry. Sample-splitting attains valid inference on model properties by using separate subsamples to estimate the model and to evaluate it. However, this approach has two drawbacks, since each task uses only part of the data, and different splits can lead to widely different estimates. Averaging across multiple splits, I develop an inference approach that uses more data for training, uses the entire sample for testing, and improves reproducibility. I address the statistical dependence from reusing observations across splits by proving a new central limit theorem for a large class of split-sample estimators under arguably mild and general conditions. Importantly, I make no restrictions on model complexity or convergence rates. I show that confidence intervals based on the normal approximation are valid for many applications, but may undercover in important cases of interest, such as comparing the performance between two models. I develop a new inference approach for such cases, explicitly accounting for the dependence across splits. Moreover, I provide a measure of reproducibility for p-values obtained from split-sample estimators. Finally, I apply my results to two important problems in development and public economics: predicting poverty and learning heterogeneous treatment effects in randomized experiments. I show that my inference approach with repeated cross-fitting achieves better power than previous alternatives, often enough to find statistical significance that would otherwise be missed.

artificial intelligence, machine learning, probability, (19 more...)

arXiv.org Machine Learning

2511.04957

Country:

Asia > India (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Education > Educational Setting (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.87)

Add feedback

Fair Performance Metric Elicitation

Neural Information Processing SystemsOct-3-2025, 08:48:06 GMT

What is a fair performance metric?

classifier, metric, procedure, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.04)
North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)
Africa > Ghana > Greater Accra > Accra (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.47)

Add feedback

94cb02feb750f20bad8a85dfe7e18d11-Supplemental.pdf

Neural Information Processing SystemsAug-15-2025, 04:25:14 GMT

algorithm, qsparse-local-sgd, synchronization, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Champaign County > Urbana (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Africa > Ghana > Greater Accra > Accra (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

94cb02feb750f20bad8a85dfe7e18d11-Paper.pdf

Neural Information Processing SystemsAug-15-2025, 04:25:07 GMT

algorithm, compressor, qsparse-local-sgd, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Champaign County > Urbana (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Africa > Ghana > Greater Accra > Accra (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Integrating mobile and fixed monitoring data for high-resolution PM2.5 mapping using machine learning

Xu, Rui, Yao, Dawen, Pian, Yuzhuang, Cao, Ruhui, Fu, Yixin, Yang, Xinru, Gan, Ting, Liu, Yonghong

arXiv.org Artificial IntelligenceMar-16-2025

Constructing high resolution air pollution maps at lower cost is crucial for sustainable city management and public health risk assessment. However, traditional fixed-site monitoring lacks spatial coverage, while mobile low-cost sensors exhibit significant data instability. This study integrates PM2.5 data from 320 taxi-mounted mobile low-cost sensors and 52 fixed monitoring stations to address these limitations. By employing the machine learning methods, an appropriate mapping relationship was established between fixed and mobile monitoring concentration. The resulting pollution maps achieved 500-meter spatial and 5-minute temporal resolutions, showing close alignment with fixed monitoring data (+4.35% bias) but significant deviation from raw mobile data (-31.77%). The fused map exhibits the fine-scale spatial variability also observed in the mobile pollution map, while showing the stable temporal variability closer to that of the fixed pollution map (fixed: 1.12 plus or minus 0.73%, mobile: 3.15 plus or minus 2.44%, mapped: 1.01 plus or minus 0.65%). These findings demonstrate the potential of large-scale mobile low-cost sensor networks for high-resolution air quality mapping, supporting targeted urban environmental governance and health risk mitigation.

artificial intelligence, concentration, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2503.12367

Country:

Asia > China > Guangdong Province > Guangzhou (0.05)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States (0.04)
(8 more...)

Genre: Research Report > New Finding (0.87)

Industry:

Transportation > Ground > Road (0.93)
Health & Medicine > Consumer Health (0.74)
Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.68)
Information Technology > Communications > Networks > Sensor Networks (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Predicting House Rental Prices in Ghana Using Machine Learning

Adzanoukpe, Philip

arXiv.org Artificial IntelligenceJan-8-2025

The housing market in Ghana has been facing significant challenges, with the rental sector being particularly affected by issues such as the advance rent system, asymmetrical perceptions between landlords and tenants, and the lack of an institutional framework for regulating the market [2]. These challenges create a highly dynamic and often opaque rental environment, where both tenants and landlords face difficulties in determining fair rental prices. This issue is further exacerbated by the absence of comprehensive and up-to-date data on rental trends, making it challenging for stakeholders to make informed decisions. In recent years, the use of machine learning in real estate has gained traction globally as a means to address such challenges. Machine learning (ML) models can analyse large datasets, uncover hidden patterns, and make accurate predictions, thereby providing valuable insights for various stakeholders in the housing market.

ghana, prediction, rental price, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.20944/preprints202412.1927.v1

2501.06241

Country:

Africa > Ghana > Greater Accra > Accra (0.04)
Africa > Ghana > Ashanti > Kumasi (0.04)
Asia > Pakistan > Sindh > Karachi Division > Karachi (0.04)

Genre: Research Report > New Finding (0.47)

Industry: Banking & Finance > Real Estate (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.72)

Add feedback

Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the U.S

Acquaye, Christabel, An, Haozhe, Rudinger, Rachel

arXiv.org Artificial IntelligenceOct-22-2024

Recent work has highlighted the culturally-contingent nature of commonsense knowledge. We introduce AMAMMER${\epsilon}$, a test set of 525 multiple-choice questions designed to evaluate the commonsense knowledge of English LLMs, relative to the cultural contexts of Ghana and the United States. To create AMAMMER${\epsilon}$, we select a set of multiple-choice questions (MCQs) from existing commonsense datasets and rewrite them in a multi-stage process involving surveys of Ghanaian and U.S. participants. In three rounds of surveys, participants from both pools are solicited to (1) write correct and incorrect answer choices, (2) rate individual answer choices on a 5-point Likert scale, and (3) select the best answer choice from the newly-constructed MCQ items, in a final validation step. By engaging participants at multiple stages, our procedure ensures that participant perspectives are incorporated both in the creation and validation of test items, resulting in high levels of agreement within each pool. We evaluate several off-the-shelf English LLMs on AMAMMER${\epsilon}$. Uniformly, models prefer answers choices that align with the preferences of U.S. annotators over Ghanaian annotators. Additionally, when test items specify a cultural context (Ghana or the U.S.), models exhibit some ability to adapt, but performance is consistently better in U.S. contexts than Ghanaian. As large resources are devoted to the advancement of English LLMs, our findings underscore the need for culturally adaptable models and evaluations to meet the needs of diverse English-speaking populations around the world.

computational linguistic, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2410.16451

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Mexico (0.04)
Asia > Thailand > Bangkok > Bangkok (0.04)
(12 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback